Accounting for Slipping and Other False Negatives in Logistic Models of Student Learning
نویسندگان
چکیده
Additive Factors Model (AFM) and Performance Factors Analysis (PFA) are two popular models of student learning that employ logistic regression to estimate parameters and predict performance. This is in contrast to Bayesian Knowledge Tracing (BKT) which uses a Hidden Markov Model formalism. While all three models tend to make similar predictions, they differ in their parameterization of student learning. One key difference is that BKT has parameters for the slipping rates of learned skills, whereas the logistic models do not. Thus, the logistic models assume that as students get more practice their probability of correctly answering monotonically converges to 100%, whereas BKT allows monotonic convergence to lower probabilities. In this paper, we present a novel modification of logistic regression that allows it to account for situations resulting in false negative student actions (e.g., slipping on known skills). We apply this new regression approach to create two new methods AFM+Slip and PFA+Slip and compare the performance of these new models to traditional AFM, PFA, and BKT. We find that across five datasets the new slipping models have the highest accuracy on 10-fold cross validation. We also find evidence that the slip parameters better enable the logistic models to fit steep learning rates, rather than better fitting the tail of learning curves as we expected. Lastly, we explore the use of high slip values as an indicator of skills that might benefit from skill label refinement. We find that after refining the skill model for one dataset using this approach the traditional model fit improved to be on par with the slip model.
منابع مشابه
Microsoft Word - Paskov-MachineLearningMethodsForBiologicalDataCuration.docx
Fulltext Figure 1: Feature set size. computational constraints. Figure 1 shows the distribution of dictionary sizes for each of our feature sets. We experimented with two data preprocessing steps: stemming and alias replacement. We used the porter2 stemming algorithm to stem all of the words in the corpus. We also wrote code to replace any biological terms or aliases with standardiz...
متن کاملThe Integration of Multi-Factor Model of Capital Asset Pricing and Penalty Function for Stock Return Evaluation
One of the main concerns of investors is the evaluation of the return on investment, which is conducted using various models such as the CAPM (single-factor model), Fama-French three/five-factor models, and Roy and Shijin’s six-factor model and other models known as multi-factor models. Despite the widespread use of these models, their major drawbacks include sensitivity to unexpected changes, ...
متن کاملOptimal and Worst-Case Performance of Mastery Learning Assessment with Bayesian Knowledge Tracing
By implementing mastery learning, intelligent tutoring systems aim to present students with exactly the amount of instruction they need to master a concept. In practice, determination of mastery is imperfect. Student knowledge must be inferred from performance, and performance does not always follow knowledge. A standard method is to set a threshold for mastery, representing a level of certaint...
متن کاملHow Effectiveness Of Comprehensive Performance Measurement Systems on Manager's Performance Through Modification of Mental Models (Learning Process)
One of the ways to reduce agency costs is to plan for the creation of effective decision-making information by designing appropriate comprehensive performance evaluation systems according to managers' learning process One of the important factors in the processing and classification of information for cognitive learning is mental models that are categorized in two dimensions of mental model co...
متن کاملFinancial Reporting Fraud Detection: An Analysis of Data Mining Algorithms
In the last decade, high profile financial frauds committed by large companies in both developed and developing countries were discovered and reported. This study compares the performance of five popular statistical and machine learning models in detecting financial statement fraud. The research objects are companies which experienced both fraudulent and non-fraudulent financial statements betw...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015